Make snapshot deletion faster #2

AmiStrn · 2021-02-18T22:05:05Z

The delete snapshot task takes longer than expected. A major reason for this is
that the (often many) stale indices are deleted iteratively.
In this commit we change the deletion to be concurrent using the SNAPSHOT threadpool.
Notice that in order to avoid putting too many delete tasks on the threadpool
queue a similar methodology was used as in executeOneFileSnapshot(). This is due to
the fact that the threadpool should allow other tasks to use this threadpool without
too much of a delay.

fixes issue elastic#61513 from Elasticsearch project

The delete snapshot task takes longer than expected. A major reason for this is that the (often many) stale indices are deleted iteratively. In this commit we change the deletion to be concurrent using the SNAPSHOT threadpool. Notice that in order to avoid putting too many delete tasks on the threadpool queue a similar methodology was used as in `executeOneFileSnapshot()`. This is due to the fact that the threadpool should allow other tasks to use this threadpool without too much of a delay. fixes issue elastic#61513 from Elasticsearch project

nknize · 2021-02-26T06:00:55Z

server/src/main/java/org/elasticsearch/repositories/blobstore/BlobStoreRepository.java

-                        "[{}] index {} is no longer part of any snapshots in the repository, " +
-                            "but failed to clean up their index folders", metadata.name(), indexSnId), e);
-                }
+            final int workers = Math.min(threadPool.info(ThreadPool.Names.SNAPSHOT).getMax(), staleIndicesToDelete.size());


Is it possible for staleIndicesToDelete to exceed the max threadPool size?

Yes, this is very much possible as the max threads in this threadpool is 5 and number of stale indices can be easily in the dozens if not more. This is defined in the ThreadPool class constructor, the max is: org.elasticsearch.threadpool.ThreadPool#halfAllocatedProcessorsMaxFive.
The reason we take the min of the two is in case there are indeed less than 5 deletions required.

Great! Thank you for the explanation!

AmiStrn added 2 commits February 19, 2021 00:04

fixed codeStyle errors (lines longer than 140 chars)

fb986ae

nknize reviewed Feb 26, 2021

View reviewed changes

nknize mentioned this pull request Apr 23, 2021

Make snapshot deletion faster opensearch-project/OpenSearch#147

Closed

AmiStrn closed this Aug 31, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make snapshot deletion faster #2

Make snapshot deletion faster #2

AmiStrn commented Feb 18, 2021

nknize Feb 26, 2021

AmiStrn Feb 26, 2021

nknize Feb 28, 2021

Make snapshot deletion faster #2

Make snapshot deletion faster #2

Conversation

AmiStrn commented Feb 18, 2021

nknize Feb 26, 2021

Choose a reason for hiding this comment

AmiStrn Feb 26, 2021

Choose a reason for hiding this comment

nknize Feb 28, 2021

Choose a reason for hiding this comment